779 research outputs found

    Application of compiler-assisted multiple instruction rollback recovery to speculative execution

    Get PDF
    Speculative execution is a method to increase instruction level parallelism which can be exploited by both super-scalar and VLIW architectures. The key to a successful general speculation strategy is a repair mechanism to handle mispredicted branches and accurate reporting of exceptions for speculated instructions. Multiple instruction rollback is a technique developed for recovery from transient processor failure. Many of the difficulties encountered during recovery from branch misprediction or from instruction re-execution due to exception in a speculative execution architecture are similar to those encountered during multiple instruction rollback. The applicability of a recently developed compiler-assisted multiple instruction rollback scheme to aid in speculative execution repair is investigated. Extensions to the compiler-assisted scheme to support branch and exception repair are presented along with performance measurements across ten application programs

    Two-dimensional tetramer-cuprate Na5RbCu4(AsO4)4Cl2: phase transitions and AFMorder as seen by 87Rb NMR

    Full text link
    We report the Rb nuclear magnetic resonance (NMR) results in a recently synthesized Na5RbCu4(AsO4)Cl2. This complex novel two-dimensional (2D) cuprate is an unique magnetic material, which contains layers of coupled Cu4O4 tetramers. In zero applied magnetic field, it orders antiferromagnetically via a second-order low-entropy phase transition at TN = 15(1) K. We characterise the ordered state by 87Rb NMR, and suggest for it a noncollinear rather than collinear arrangement of spins. We discuss the properties of Rb nuclear site and point out the new structural phase transition(s) around 74 K and 110 K.Comment: 2 pages, 2 figures, Proceedings of SCES'05, Vienna 200

    Array concepts for solid-state and vacuum microelectronics millimeter-wave generation

    Get PDF
    The authors have proposed that the increasing demand for contact watt-level coherent sources in the millimeter- and submillimeter-wave region can be satisfied by fabricating two-dimensional grids loaded with oscillators and multipliers for quasi-optical coherent spatial combining of the outputs of large numbers of low-power devices. This was first demonstrated through the successful fabrication of monolithic arrays with 2000 Schottky diodes. Watt-level power outputs were obtained in doubling to 66 GHz. In addition, a simple transmission-line model was verified with a quasi-optical reflectometer that measured the array impedance. This multiplier array work is being extended to novel tripler configurations using blocking barrier devices. The technique has also been extended to oscillator configurations where the grid structure is loaded with negative-resistance devices. This was first demonstrated using Gunn devices. More recently, a 25-element MESFET grid oscillating at 10 GHz exhibited power combining and self-locking. Currently, this approach is being extended to a 100-element monolithic array of Gunn diodes. This same approach should be applicable to planar vacuum electron devices such as the submillimeter-wave BWO (backward wave oscillator) and vacuum FET

    Millimeter and submillimeter wave technology developments for the next generation of fusion devices

    Get PDF
    There is increasing demand for compact watt-level coherent sources in the millimeter and submillimeter wave region. The approach that we have taken to satisfy this need is to fabricate two-dimensional grids loaded with oscillators, electronic beam steerers, and frequency multipliers for quasioptical coherent spatial combining of the outputs of a large number of low-power devices

    Exposing errors related to weak memory in GPU applications

    Get PDF
    © 2016 ACM.We present the systematic design of a testing environment that uses stressing and fuzzing to reveal errors in GPU applications that arise due to weak memory effects. We evaluate our approach on seven GPUS spanning three NVIDIA architectures, across ten CUDA applications that use fine-grained concurrency. Our results show that applications that rarely or never exhibit errors related to weak memory when executed natively can readily exhibit these errors when executed in our testing environment. Our testing environment also provides a means to help identify the root causes of such errors, and automatically suggests how to insert fences that harden an application against weak memory bugs. To understand the cost of GPU fences, we benchmark applications with fences provided by the hardening strategy as well as a more conservative, sound fencing strategy

    GPU Concurrency: Weak Behaviours and Programming Assumptions

    Get PDF
    Concurrency is pervasive and perplexing, particularly on graphics processing units (GPUs). Current specifications of languages and hardware are inconclusive; thus programmers often rely on folklore assumptions when writing software. To remedy this state of affairs, we conducted a large empirical study of the concurrent behaviour of deployed GPUs. Armed with litmus tests (i.e. short concurrent programs), we questioned the assumptions in programming guides and vendor documentation about the guarantees provided by hardware. We developed a tool to generate thousands of litmus tests and run them under stressful workloads. We observed a litany of previously elusive weak behaviours, and exposed folklore beliefs about GPU programming---often supported by official tutorials---as false. As a way forward, we propose a model of Nvidia GPU hardware, which correctly models every behaviour witnessed in our experiments. The model is a variant of SPARC Relaxed Memory Order (RMO), structured following the GPU concurrency hierarchy

    Application of Compiler-Assisted Multiple Instruction Rollback Recovery to Speculative Execution

    Get PDF
    Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Aeronautics and Space Administration / NASA NAG 1-613Department of the Navy managed by the Office of the Chief of Naval Research / N00014-91-J-128

    Single-Pass Memory System Evaluation for Multiprogramming Workloads

    Get PDF
    Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Science Foundation (NSF) / MIP-8809478NCRNational Aeronautics and Space Administration (NASA) / NASA NAG 1-613Office of Naval Research / N00014-88-K-065

    The Susceptibility of Programs to Context Switching

    Get PDF
    Coordinated Science Laboratory was formerly known as Control Systems LaboratoryNational Science Foundation / MIP-8809478NCR Corp.AMD Corp. 29K Advanced Processor Development DivisionNational Aeronautics and Space Administration / NASA NAG 1-613Office of Naval Research / N00014-88-K-0656Hewlett-Packard Co
    • …
    corecore